CLAIMS 



What is claimed is: 



1 1. A method for document classification comprising: 

2 analyzing textual and graphical properties of an electronic document; 

3 generating a classification of the document based on the textual and 

4 graphical properties; and 

5 storing the electronic document in a pre-existing document hierarchy 
6 . based on the classification. 

1 2. The method defined in Claim 1 where the document hierarchy 

2 comprises a hierarchy of docimients mirroring in a similar fashion an 

3 organization in a pre-existing memory storing documents. 

1 3. The method defined un Claim 2 when the pre-existing memory 

2 comprises a hard disk. \ 

1 4. The method of claim 1, wherein analyzing textual properties of 

2 an electronic document comprises: \ 

3 determining characteristic words of the document; 

4 determining a frequency for each characteristic word; and 

5 building a frequency table based on the frequency associated with each 
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1 5. XThe method of claim 1, wherein analyzing graphical properties 

2 of an electronicMocument comprises: 

3 v determining\a point set corresponding to the electronic document, 

4 wherein points of theypoint set correspond to end points of lines; 

5 determining a density of points within the point set; 

6 generating a document profile based, at least in part, on the density of 

7 points within the point set\ 

1 6. The method of claim 1, wherein generating a classification of the 

2 document based on the textual and graphical properties comprises combining 

3 results from the textual and graphical analysis using a Borda Count. 



L 



1 7. The method defined >^\Q#im 1 further comprising building the 

2 pre-existing directory structure by building hierarchy of documents based on a 

3 user's hard drive. 

1 8. The method defined in Claim 1 further comprising building the 

2 pre-existing directory structure by extracting\graphical and text features from 

3 documents in a directory-based memory to obtain a 
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9. software product including a machine-readable medium 
having stored thereon sequences of instructions, which, when executed by a 
processor, cause ink processor to: 

analyze textual and graphical properties of an electronic document; 

generate a classification of the document based on the textual and 
graphical properties; and\ 

store the electronic document in a pre-existing directory structure based 
on the classification. \ 

10. The machine-readable medium of claim 9, wherein the 
sequences of instructions that cause the processor to analyze textual properties 
of an electronic document further comprise sequences of instructions that 
cause the processor to: / \ 

determine characterisuc^ord^Tof the document; 
determine a frequency for each characteristic word; and 
build a frequency table based on me frequency associated with each 
characteristic word. \ 

11. \The machine-readable medium of claim 5, wherein the 
sequences of instructions that cause the processor to analyze graphical 
properties of an\electronic document further comprise sequences of 
in s tr uctions t hat cau r. r Ihr p r n T'ss rn I n ' \ 
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5 determine a point set corresponding to the electronic document, 

6 wherein points\pf the point set correspond to end points of lines; 

7 determines density of points within the point set; 

8 generate a document profile based, at least in part, on the density of 

9 points within the point set. 

1 12. The machine-readable medium of claim 5, wherein the 

2 sequences of instructions that cause the processor to generate a classification 

3 of the document based oA the textual and graphical properties further 

4 comprises sequences of insm^tions that cause the processor to combine 

5 results from the textual anaWaphical analysis using a Borda Count. 



1 13. A method for document classification comprising: 

2 analyzing documents in a pre-existing document directory structure to 

3 determine an organization of the pre-existing document directory structure; 

4 generating a mirror directory structure based on the pre-existing 

5 document directory structure; and\ 

6 placing a document in the mirror directory structure based on the 

7 organization of the pre-existing document directory structure, results of 

8 textual analysis of the document, and results of graphical analysis of the 

9 documen t: ' 
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1 14. The method of claim 13, wherein analyzing the pre-existing 

2 document directory structure to determine the organization further 

3 comprises: 

4 ^ recursively descending the pre-existing document directory structure; 

5 generating^ lis\ of directories in the pre-existing document directory 

6 structure; 

7 examining files in directories of the pre-existing document directory 

8 structure to determine content; and 

9 examining the content of the files to determine the organization of the 
10 directories in the pre-existing cfpcument directory structure. 

1 15. The method^of claim 13 wherein the pre-existing document 

2 directory structure is a hard disk directory structure. 



1 
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16. The method of claim 13\wherein generating a mirror directory 
structure based on the pre-existing document^ directory structure comprises 
generating a 4pcument directory structure having a set of directories and 
relationships equivalent to the pre-existing document directory structure. 



1 17. The method of claim 13 wherein placing a document in the 

2 mirror directory structure based on the organization of the pre-existing 

3 d ^mm p nt Hirnrtory s trnrtnrp r onnprisps- . 

- \ 
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4 determining a primary directory in the pre-existing document directory 

5 structure in wnich the document should be placed based on the organization 

6 of the pre-existing document directory structure; and 

7 storing the document in a primary corresponding directory in the 

8 mirror directory structure that corresponds to the primary directory in the 
A/ 9 pre-existing document directory structure. 



i.u 
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18. The method^ df claim 17 further comprising: 
determining a secorWary directory in the pre-existing document 
directory in which the document should be placed based on the organization 
of the pre-existing document dVectbry structure; and 

storing the documerrtina Corresponding secondary directory in the 
mirror directory structure ^mtco^esponds to the secondary directory in the 
pre-existing document directgry structure. 



1 19. A computer-readable medium having stored thereon sequences 

2 of instructions which, when executed by a processor, cause the processor to: 

3 analyze a pre-existing document directory structure to determine an 

4 organization of the pre-existing document directory structure; 

5 generate a mirror directory structure based on the pre-existing 

6 document directory structure; and 

7 place a document in the mirror directory structure based on the 

8 organization of the pre-existing document directory structure. 
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1 20. The computer-readable medium of claim 19, wherein the 

2 sequences of instructLs that cause the processor to analyze a pre-existing 

3 document directory structure to determine an organization of the pre-existing 

4 document directory stru\ture further comprise sequences of instructions that 

5 cause the processor to: 

6 recursively descending the pre-existing document directory structure; 

7 generating a list of directories in the pre-existing document directory 

8 structure; 

9 examining files in directories of the pre-existing document directory 

10 structure to determine content; 

1 1 examining the content A Ae files to determine the organization of the 

12 directories in the pre-existingU^^nt directory structure. 



1 21. The computer-readable Vnedium of claim 19, wherein the 

2 sequences of instructions that cause tAe processor to generate a mirror 

3 directory structure based on the pre-exlting document directory structure 



6 



1 



4 further comprise sequences of instruct^ that cause the processor to 

5 generate a document directory structure having a set of directories and 
relationships equivalent to the pre-existink document directory structure. 



22. The computer-readable mediurA of claim 19, wherein the 
2 sequences of instructions that cause the proceLr to place a document in the 
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3 mirror directory structure based on the organization of the pre-existing 

4 document directory structure further comprise sequences of instructions that 

5 cause the processor to: 

6 determine a primary directory in the pre-existing document directory 

7 structure in which me document should be placed based on the organization 

8 of the pre-existing document directory structure; and 

9 store the document in a primary corresponding directory in the mirror 

10 directory structure thai corresponds to the primary directory in the pre- 

1 1 existing document directory structure. 

1 23. The computer-readable medium of claim 22 further comprising 

2 sequences of instructions tnkt cause the processor to: 

3 determine a secondary afrectory in the pre-existing document directory 

4 in which the document should be placed based on the organization of the pre- 

5 existing document directory structure; and 

6 store the document in a\ corresponding secondary directory in the 

7 mirror directory structure that corresponds to the secondary directory in the 

8 pre-existing document directory\structure. 

1 24. An apparatus comprising: 

2 means for analyzing a pre-existing document directory structure to 

3 determine an organization of the pre-existing document directory structure; 
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meanslpr generating a mirror 



directory structure based on the pre- 



existing document directory structure; and 

means teU a document in the mirror directory structure based 



on the 



organiza.ion\f the pre-existing document directory structure. 



, 25. The appaitus of claim 24, wherein means for analyzing the pre- 

2 existing document directory structure to determine the organization further 

comprises: 

means for r e cursively\de-nding the pre-existing document directory 

structure; 

means for generating A> of directories in the pre-existing document 

7 directory structure; 

means for examining files I directories of the pre-existing document 

9 directory structure to determine coWent; and 

means for examining the con\en, of the files to determine the 
„ organization of the directories m the^re-existing document directory 
12 structure. 



3 
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26. The apparatus of claim 24, Wherein meanS for * 

u a ™ tV.P nte-existing document directory 
mirror directory structure based on the pte existing 

for generating \ document directory structure 



structure comprises means 



4 having a set of directories 



and relationships\equivalent to the pre-existing 
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means 



27. The apparatus of claim 24, wherein means for placing a 
document in .he mirror directory structure based on the organization of the 
pre-existing documentWectory structure comprises: 

for determW a P™^ direC ' 0ry *" P^ 8 " 118 
document directory stru-re in which the document should be placed based 
on the organization of their .existing document directory structure; and 

m eans for storing th\ document in a primary corresponding directory 
ta the mirror directory strucLe that corresponds to the primary directory in 
the pre-existing document directory structure. 



1 



28. The apparatus ft Aairri 27 further comprising: 
m eans for determmi^Wndary directory in the pre-existing 
document directory in which theVocument should be placed based on the 

organization of the pre-existing 

means for storing the document in a corresponding secondary directory 
in the mirror directory structure tha\ corresponds to the secondary directory 
in the pre-existing document directory structure. 



1 
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29. A document processing sVstem comprising: 

a document scanning device; 

device couple\l to the document scanning device, 



a document storage 



wherein the document storage 



device is organized as a document directory 
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5 structure having multiple directories, and further wherein the document 

6 storage device has a mirror directory structure having multiple directories 

7 organized based on the document directory structure; and 

8 a processor couplea to the document scanning device and to the 

9 document storage device, wherein the processor analyzes a document 

10 scanned by the document scanning device to determine a directory in the 

11 document directory structure! in which the document should be placed and 

12 stores the document in a corresponding directory in the mirror directory 
O 13 structure. 



1 30. The document processing system of claim 29 wherein the 

2 processor determines a secondar^ director in the document directory structure 

3 in which the document should bel placed and stored the document is a 

4 corresponding secondary directory in\the mirror directory structure. 

1 31. The document processing system of claim 29 wherein the 

2 processor analyzes files stored in the document directory structure to 

3 determine content and generates an organizational description of the 

4 document directory structure based on the analysis. 



1 32. The document processing system of claim 29 wherein the 

2 document is analyzed based on image and textual content. 



\ 
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